A Weighing Based Feature Selection and Monotonic Classification (WFSMC) for the Effective Prediction of Outliers
نویسنده
چکیده
Advancement in computer technologies in data processing explores the capabilities of research, communication and services in recent years. Knowledge Discovery on Database (KDD) is an essential process on data processing. Data mining is one of the form of KDD. Many feature selection and classification algorithms are used to select the relevant features and classify them according to criteria in data mining applications. The enormous quantity of high dimensional data leads to challenge in traditional data mining techniques. The employment of classifiers in data has deteriorated the performance of machine learning process since it sensible to noise, relevant or irrelevant features. The ordering of decision values on the basis of relevant features improve the performance of the classification process. In this paper, Weighing based Feature Selection and Monotonic Classification (WFSMC)are proposed to know the importance of relevant feature and design the feature selection based on constraints. This paper discusses the new feature weighing classifier on the basis of imputation methods in two ways: 1) Estimation of new distribution values for each feature, 2) Statistical test to measure the changes between original and distributed values. Based on the principle of large margin, the new feature selection algorithm evolved by introducing the monotonic constraints. The performance analysis such as no. of attributes, accuracy, precision and recall shows that proposed weighing based feature selection yields better results than the traditional algorithms.
منابع مشابه
Optimal Feature Selection for Data Classification and Clustering: Techniques and Guidelines
In this paper, principles and existing feature selection methods for classifying and clustering data be introduced. To that end, categorizing frameworks for finding selected subsets, namely, search-based and non-search based procedures as well as evaluation criteria and data mining tasks are discussed. In the following, a platform is developed as an intermediate step toward developing an intell...
متن کاملOptimal Feature Selection for Data Classification and Clustering: Techniques and Guidelines
In this paper, principles and existing feature selection methods for classifying and clustering data be introduced. To that end, categorizing frameworks for finding selected subsets, namely, search-based and non-search based procedures as well as evaluation criteria and data mining tasks are discussed. In the following, a platform is developed as an intermediate step toward developing an intell...
متن کاملMental Arithmetic Task Recognition Using Effective Connectivity and Hierarchical Feature Selection From EEG Signals
Introduction: Mental arithmetic analysis based on Electroencephalogram (EEG) signal for monitoring the state of the user’s brain functioning can be helpful for understanding some psychological disorders such as attention deficit hyperactivity disorder, autism spectrum disorder, or dyscalculia where the difficulty in learning or understanding the arithmetic exists. Most mental arithmetic recogni...
متن کاملClassification of Right/Left Hand Motor Imagery by Effective Connectivity Based on Transfer Entropy in EEG Signal
The right and left hand Motor Imagery (MI) analysis based on the electroencephalogram (EEG) signal can directly link the central nervous system to a computer or a device. This study aims to identify a set of robust and nonlinear effective brain connectivity features quantified by transfer entropy (TE) to characterize the relationship between brain regions from EEG signals and create a hierarchi...
متن کاملA New Hybrid Method for Improving the Performance of Myocardial Infarction Prediction
Abstract Introduction: Myocardial Infarction, also known as heart attack, normally occurs due to such causes as smoking, family history, diabetes, and so on. It is recognized as one of the leading causes of death in the world. Therefore, the present study aimed to evaluate the performance of classification models in order to predict Myocardial Infarction, using a feature selection method tha...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015